Polynucleotides encoding monomeric variants of the tetrameric eqFP611

ABSTRACT

DNA encoding a monomeric variant of red fluorescent protein eqFP611 comprising an amino acid sequence selected from the group consisting of SEQ ID No. 1, SEQ ID No. 3 and SEQ ID No. 5. DNA comprising a nucleotide sequence selected from the group consisting of SEQ ID No. 2, SEQ ID No. 4 and SEQ ID No. 6.

TECHNICAL FIELD

This disclosure relates to monomeric variants of the red fluorescentprotein eqFP611 (Genbank accession number AY130757) exhibiting acomparatively large Stokes shift in the orange red region of thespectrum between 557 and 630 nm, and their versatile fields ofapplication in biology, in particular live-cell imaging.

BACKGROUND

Fluorescent proteins (FPs) are powerful, specific marker tools forcellular imaging, and their range of applications is continuouslyexpanding (1). Red fluorescent proteins (RFPs) are of particularinterest as they extend the color palette for multi-channel and FRET(fluorescence resonance energy transfer) imaging, and the reducedscattering of long-wavelength light makes them attractive as markers fordeep-tissue imaging. A performance gap has been noticed for natural RFPsafter isolation from marine invertebrates (2-4), in comparison tovariants of the classical green fluorescent protein (GFP) (1). Inparticular, their tendency to form dimers or tetramers can bedetrimental for fusion marker applications (4-6). Therefore, an entire“fruit basket” of monomeric FPs in many different hues was engineeredfrom the tetrameric RFP DsRed (1, 7-8). Together with monomeric variantsof GFP, eqFP578 and several reef coral proteins, the emission colors ofthese FPs cover a wide range from blue to red (1, 6, 9-10). Brightfar-red fluorescent markers with emission maxima shifted up to 639 nmwere developed on the basis of the Entacmaea quadricolor proteinseqFP578 (11).

Important FP applications are in biosensors that report on intracellularconditions via fluorescence resonance energy transfer (FRET) between twoor more chromophors (1, 9-10). A large Stokes shift, i.e., a largewavelength separation between excitation and emission peaks assists inchannel separation and facilitates the use of FPs in FRET-basedapplications, in multi-color imaging application and in whole-body anddeep tissue imaging. Several red fluorescent proteins with Stokesshifts>47 are characterized by excitation/emission peaks beyond 582/629nm. In contrast, there is no fluorescent protein known which exhibits acomparably large Stokes shift in the orange red region of the spectrumbetween 557 and 630 nm.

Accordingly, it could be helpful to provide novel monomeric fluorescentproteins having a comparably large Stokes shift in the orange red regionof the spectrum between 557 and 630 nm, in particular to facilitatebiological imaging or imaging applications.

SUMMARY

We provide a monomeric variant of red fluorescent protein eqFP611including an altered amino sequence having at least one amino acidexchange selected from the group consisting of M12K, M15L, Y22H, D31E,V46I, K67R, H72Y, T73P, F102I, K120Q, L147M, S158T, Q159H, N163K, Y169H,E185G, F187I, F192I, F194A, E175V, K207N, H214R, C222A, D223G, P225G,S226G and K227G.

We also provide a monomeric variant of red fluorescent protein eqFP611including an amino acid sequence selected from the group consisting ofSEQ ID No. 1, SEQ ID No. 3 and SEQ ID No. 5.

We further provide a fusion protein including a protein of interestfused to the monomeric variant of the red fluorescent protein eqFP611.

We still further provide a composition including the monomeric variantof the red fluorescent protein eqFP611 in a biologically compatiblemedium.

We further yet provide DNA encoding a monomeric variant of redfluorescent protein eqFP611 including an amino acid sequence selectedfrom the group consisting of SEQ ID No. 1, SEQ ID No. 3 and SEQ ID No.5.

We also further provide DNA including a nucleotide sequence selectedfrom the group consisting of SEQ ID No. 2, SEQ ID No. 4 and SEQ ID No.6.

We yet again provide an expression vector including expression controlsequences operably linked to the DNA.

We further yet again provide a host cell including the DNA.

We also yet again provide a fluorescent reagent kit including at leastone of a) DNA of SEQ ID No. 2, 4 or 6, b) an expression vector havingDNA of SEQ ID No. 2, 4 or 6, and c) a host cell having DNA of SEQ ID No.2, 4 or 6.

We still yet again provide a method of preparing a monomeric variant ofred fluorescent protein eqFP611 including a) cultivating the host cellso that the host cell produces a monomeric variant of the redfluorescent protein eqFP611, and b) isolating and purifying themonomeric variant so produced by the host cell.

We also provide a method of selecting cells expressing a protein ofinterest including (a) introducing into cells DNA encoding a protein ofinterest and DNA encoding the monomeric variant of the red fluorescentprotein eqFP611, (b) cultivating the cells from (a) under conditionspermitting expression of the monomeric variant and the protein ofinterest, and (c) selecting cultivated cells from (b) which express themonomeric variant.

We also provide a method of measuring expression of a protein ofinterest in a cell including (a) introducing into a cell DNA encodingthe monomeric variant of the red fluorescent protein eqFP611 fused toDNA encoding a protein of interest, the DNA being operably linked to andunder control of an expression control sequence which moderatesexpression of the protein of interest, (b) cultivating the cell underconditions suitable for expression of the protein of interest, and (c)detecting fluorescence emission of the monomeric variant to measureexpression of the protein of interest.

We also provide a method of determining localization of a protein ofinterest including (a) introducing into a cell DNA encoding themonomeric variant of the red fluorescent protein eqFP611 fused to a DNAencoding a protein of interest, the DNA being operably linked to andunder control of a suitable expression control sequence, (b) cultivatingthe cell under conditions suitable for expression of the protein ofinterest; and (c) determining localization of the protein of interest bydetecting fluorescence emission of the monomeric variant by opticalmeans.

We also provide a method of localizing a subcellular structure in a cellincluding (a) introducing into a cell DNA encoding the monomeric variantof the red fluorescent protein eqFP611 fused to a DNA encoding a proteinspecific for a subcellular structure, the DNA being operably linked toand under control of an expression control sequence which moderatesexpression of the protein specific for a subcellular structure, (b)cultivating the cell under conditions suitable for expression of theprotein specific for a subcellular structure; and (c) detectingfluorescence emission of the monomeric variant to localize thesubcellular structure in the cell.

We also provide a method of comparing an effect of one or more testsubstance(s) on expression and/or localization of a protein of interestincluding (a) introducing into a cell DNA encoding the monomeric variantof the red fluorescent protein eqFP611 fused to a DNA encoding a proteinof interest, the DNA being operably linked to and under control of asuitable expression control sequence, (b) cultivating the cell underconditions suitable for expression of the protein of interest in thepresence and absence of the test substance(s), (c) determiningexpression and/or localization of the protein of interest by detectingfluorescence emission of the monomeric variant by optical means, and (d)comparing the fluorescence emission obtained in the presence and absenceof the test substance(s) to determine the effect of the testsubstance(s) on the expression and/or localization of the protein ofinterest.

We also provide a method of determining promoter activity in a cellincluding (a) introducing into a cell a vector that is constructed suchthat DNA encoding the monomeric variant of the red fluorescent proteineqFP611 is located downstream of a promoter to be tested, and (b)detecting fluorescence of the monomeric variant to determine activity ofthe promoter.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1A shows subcellular localization in HEK293 cells with eqFP611dimer 1 (RFP611, T122R) (13) evenly distributed over the cell.

FIG. 1B shows subcellular localization in HEK293 cells with the enhancedeqFP611 dimer 2 (T122G, F194A) in granular localization.

FIG. 1C is a phase contrast image of cells co-expressing EGFP-SKL and anenhanced monomeric eqFP611 (T122R, F194A) variant.

FIG. 1D is a green-channel image of cells co-expressing EGFP-SKL and anenhanced monomeric eqFP611 (T122R, F194A) variant.

FIG. 1E is red-channel image produced with a standard TRITC filter setof cells co-expressing EGFP-SKL and an enhanced monomeric eqFP611(T122R, F194A) variant.

FIG. 1F is an overlay image of the green and red channels of cellsco-expressing EGFP-SKL and an enhanced monomeric eqFP611 (T122R, F194A)variant.

FIG. 2A shows in vitro chromophore maturation at 37° C. as determinedfrom the fluorescence emission at 605 nm (solid line: exponential fit).

FIG. 2B shows elution volume for concentrations of 4-30 mg/ml (0.16-1.2mM) during gel-filtration analysis (standards for the oligomerizationdegree: GFP S65T, monomer; td-RFP611, dimer; eqFP611, tetramer).

FIG. 2C shows absorption, excitation and emission spectra (inset:fluorescence decay with exponential fit yielding a lifetime of2.6.+−.0.1 ns).

FIG. 2D shows absorption spectra at various pH values (inset: absorptionat 558 nm versus pH; Henderson-Hasselbalch fit reveals a pKa of 4.4).

FIG. 2E shows the increasing absorption at about 390 nm along with thedecrease of the about 463 nm band of the protonated chromophore of themonomeric eqFP611 variant indicates the destruction of the acyliminebond of the chromophore over a period of 3.5 h time at pH 1.3.

FIG. 2F shows changes in the absorption spectrum of the monomericeqFP611 variant during a 1-h time course of alkaline denaturation at pH13.

FIG. 2G shows time-dependent fluorescence decay of the monomeric eqFP611variant at pH 13 in comparison to other monomeric fluorescent proteins,measured, at their emission maxima.

FIG. 3A shows an epifluorescence image of NIH3T3 cells expressing themonomeric eqFP611 variant fused to human α-tubulin. Two cells from asingle image were arranged in closer proximity.

FIG. 3B shows a spinning-disk confocal image of the endoplasmicreticulum of a HeLa cell stained with ER-monomeric-eqFP611-variant-KDEL(“KDEL” disclosed as SEQ ID No. 7). False colors encode fluorescenceintensity.

FIG. 4A is a confocal image (Leica TCS 4Pi) of HEK293 cellsco-expressing monomeric eqFP611 variant-PTS and H₂B-EGFP duringinterphase. Bars: 2 μm.

FIG. 4B is a confocal image (Leica TCS 4Pi) of HEK293 cellsco-expressing monomeric eqFP611 variant-PTS and H₂B-EGFP duringmetaphase. The inset in shows a magnification of a peroxisomal cluster.Bars: 2 μm.

FIG. 4C is a confocal image (Leica TCS 4Pi) of HEK293 cellsco-expressing monomeric eqFP611 variant-PTS and H₂B-EGFP with tubulinfibers labeled in green with Alexa Fluor®-488 via an anti-α-tubulinantibody to reveal peroxisomal clustering at the spindle poles. Bars: 2μm.

FIG. 5 shows the following: Inheritance of peroxisomes. Distribution ofperoxisomes during different stages of mitosis in HeLa (upper row) andNIH3T3 cells (lower row). Peroxisomes are highlighted by monomericeqFP611 variant-PTS. The spindle of the mitotic cells is stained withEGFP fused to a tubulin-binding protein [13]. Images were taken 24-48 hafter transfection on an Olympus IX71 microscope equipped with standardFITC and TRITC filter sets. Bars: 5 μ·m.

FIG. 6A shows a comparison of ER-staining in HeLa cells expressingER-EGFP-KDEL (“KDEL” disclosed as SEQ ID No. 7) (left column),ER-monomeric eqFP611 variant-KDEL (“KDEL” disclosed as SEQ ID No. 7)(right column) or both (middle column). Imaging of the EGFP fluorescencein the green channel (upper row) required exposure times of 4.0 swhereas fluorescence of the monomeric eqFP611 variant in the red channelcould be already recorded with exposure times of 0.4 s using an OlympusIX71 microscope equipped with standard FITC (green channel) and TRITC(red channel) filter sets. Bars: 2 μm.

FIG. 6B shows a magnified view of a HeLa cell expressing ER-monomericeqFP611 variant-Kdel (“Kdel” disclosed as SEQ ID No. 7) photographed onthe Olympus IX71 microscope with TRITC filter. Bars: 2 μm.

FIG. 7 displays a multiple sequence alignment of monomeric redfluorescent proteins and their tetrameric predecessors (SEQ ID Nos. 23,1 and 24-29, respectively, in order of appearance). Modified amino acidresidues are highlighted in different colors; cyan: residues involved ininterface interactions; green: residues in the interior of the.beta.-can; magneta: residues interacting with the chromophore; yellow:external residues; gray: GFP-termini of mCherry. The chromophore isunderlined; .beta.-strands are indicated by arrows.

DETAILED DESCRIPTION

We provide a monomeric variant of the red fluorescent protein eqFP611having an altered amino sequence featuring at least one of the aminoacid exchanges selected from the group consisting of M12K, M15L, Y22H,D31E, V46I, K67R, H72Y, T73P, F102I, K120Q, L147M, S158T, Q159H, N163K,Y169H, E185G, F187I, F192I, F194A, E175V, K207N, H214R, C222A, D223G,P225G, S226G and K227G.

The numbering of the above listed amino acid exchanges are based on theamino acid sequence of eqFP611 deposited under genbank accession numberAY130757. Further, amino acids are described using the single lettercode. In this regard, for instance the term “M12K” means that the aminoacid M (methionine) at position 12 from the eqFP611 has been exchangedby the amino acid K (lysine).

More specifically, we provide a monomeric variant of the red fluorescentprotein eqFP611 having an amino acid sequence selected from the groupconsisting of SEQ ID No. 1, SEQ ID No. 3 and SEQ ID No. 5. In thisregard, the monomeric rqFP611 variant as set forth in SEQ ID No. 3 is acodon optimized variant of the monomeric eqFP611 variant as set forth inSEQ ID No. 1. The monomeric eqFP611 variant as set forth in SEQ ID No. 5carries the C-terminal tail of eqFP611 as a natural peroxisomaltargeting signal.

The monomeric red fluorescent proteins are created from the tetramericred fluorescent protein eqFP611 from the sea anemone Entacmaeaquadricolor (Genbank accession number AY130757). The monomeric eqFP611variants are in particular useful as labeling substances, preferably forlive-cell imaging as well for imaging of fixed cells. The fluorescentproteins are characterized by advantageous properties such as redfluorescence with a large Stokes shift, monomeric nature, functionalexpression at 37° C., functional localization in the endoplasmicreticulum of mammalian cells, functional expression in fusion withtubulin in NIH3T3 cells, superior stability of fluorescence in cellsfixed with paraformaldehyde and superior stability at extreme pH values.Moreover, a variant is presented that allows labeling of peroxisomes ineukaryotic cells.

The F102I variant of the tetrameric eqFP611 was chosen as the startingmaterial for the development of a monomeric variant owing to itsexcellent expression at 37°. Introduction of amino acids₁₂₂ arginine and₁₉₄alanine, which were known to disrupt the A/B and A/C subunitinteractions in tetrameric DsRed and eqFP611 (7, 12), resulted in anearly complete loss of fluorescence. In several rounds of random andmulti-site-directed mutagenesis, the fluorescence was recovered byreplacing amino acids that apparently impeded proper folding andmaturation of the chromophor. In HEK293 cells transfected with thesemonomeric eqFP611 variants, the red fluorescence was not evenlydistributed but rather appeared as dot-like structures in the cytoplasm.This localization resembled closely the aggregates reported for severalnon-soluble GFP-like proteins such as dsRed. In-depth analysis of thesequence and observation of the behavior of the “aggregates” during thecell cycle revealed that exposure of the A/C interface duringmonomerization exposed a subcellular targeting signal that directs theprotein in cellular organelles, namely the amino acid sequence ₂₂₉GRL₂₃₁at the C terminus of the monomeric eqFP611 variant as set forth in SEQID No. 5, and also the preceding triplet ₂₂₆SKL₂₂₈ serve as (type-1)peroxisomal targeting signal (PTS) (13). The presence of a functionalPTS was verified by imaging HEK293 cells co-transfected with monomericeqFP611 as set forth in SEQ ID No. 5 and EGFP-SKL, an establishedperoxisomal marker. Both green and red fluorescence showed perfectco-localization (FIG. 1). We succeeded in removing peroxisomal targetingby replacing the C-terminal sequence ₂₂₂CDLPSKLGRL₂₃₁ (SEQ ID No. 8) ofeqFP611 by ₂₂₂AGLGGG₂₂₇ (SEQ ID No. 9); amino acid modification and eventruncation of the ₂₂₉GRL₂₃₁ tripeptide did not suffice (SupplementalTable 1). After completion of seven rounds of random mutagenesis andfour rounds of multi site-directed mutagenesis, a bright red-fluorescentmonomer as set forth in SEQ ID No. 1 was finally obtained. Compared towild-type eqFP611, it contains altogether 28 amino acid replacements andis shorter by four amino acids. Additional mutations may be introduced.For instance at least one amino acid exchange selected from the groupconsisting of N2Y/K/D, S3P, L4M, K6Q/E, NS8K, M11T, M12K/V/R/L, M15V/L,E16D, Y22H, T27I, E29K, D31E, N33S/K, Y35F, M36L/V/K, K44N/R/Q, V46I/A,I57V, K67R, H72Y, T73S/P, I76V/F, S83T, D98E, F102I/L/V, V104A, M105I,E111V, D112G, C114S, Y117H, A119V, K120R/Q, T122R/A/G/M, S128A/P,N129H/D, Q134R, K136R, M146I, L147M, Y148C, D151N, Y157H/F, S158T,Q159H, A161V, N163K/R, D165E, Y169H, S171F/Y, S173F, E175V; T183S/A,V184D/I, E185G, N186Y, F187I/L, K188Q/E, G191D, F192I/Y, F194A/V/T,H197Y, E204D, S205R, D206G, K207D/N/Q/E, M209K/T, Q213L, H214R, H216R/Y,V218E, F221I/S/L/Y, C222W/Y/G/C, L224P, P225S and L231P may beintroduced in a monomeric variant of the red fluorescent proteineqFP611. These amino acid sequences may further improve folding ofmonomeric eqFP611 variants. To further improve the performance of themonomeric eqFP611 variant as set forth in SEQ ID No. 1 in live-cellimaging applications, the codon usage of the final variant was alteredto facilitate expression. By this method, the fluorescence in mammaliancells could be increased by 5-8 fold. The codon optimized variant as setforth in SEQ ID No. 3 is particularly useful for labeling purposes insensitive systems such as primary cell cultures or stem cells.

The following properties relate to the monomeric eqFP611 variant as setforth in SEQ ID No. 1 and 3. At 37° C., fluorescence spectrometry ondilute solutions of the purified protein yields a half-maturation timeof 2.8 h, which makes the protein suitable for most cell biologicalimaging applications (FIG. 2A). A tendency to dimerize was absent in theentire range of physiologically relevant concentrations (FIG. 2B). Withexcitation and emission maxima at 558 nm and 605 nm, respectively, theeqFP611 variant combines a monomeric nature with a uniquely large Stokesshift in the orange-red emission region (FIG. 2C, Supplementary Table2). Its fluorescence lifetime of 2.6±0.1 ns is comparatively long for amonomeric red fluorescent protein (6), and together with its highquantum yield (0.35) and molar extinction coefficient of 112,000 M⁻¹cm⁻¹, the monomeric eqFP611 variant appears as superb marker in the redspectral range. It may be especially useful for FRET applications, whereit bridges the gap between yellow-orange and far-red fluorescentproteins. The photobleaching probability of the monomeric eqFP611variant (5.3×10⁻⁶) is very similar to the one of its predecessor RFP611(4.9×10⁻⁶). The engineering process has significantly improved thestability of the monomeric eqFP611 variant at pH extremes as compared totetrameric eqFP611. Below pH 5, the monomeric eqFP611 variantchromophore becomes protonated, as indicated by the increasingabsorption band at 463 nm (FIG. 2D), but the protein remains stable evenat pH 3. In contrast, the eqFP611 chromophore decomposes quickly at pH3, as witnessed by the appearance of a 390-nm absorption band indicativeof the neutral GFP (Green Fluorescent Protein) chromophore. At pH 13,the fluorescence of a variety of monomeric fluorescent proteinsincluding EGFP (Enhanced Green Fluorescent Protein) disappears withinseconds, whereas the fluorescence of the monomeric eqFP611 variant takesabout an hour to vanish completely (FIGS. 2E-G). This stability at pHextremes makes the monomeric eqFP611 variant suitable for imaging ofcellular organelles with pH values deviating from the standardphysiological pH around pH 7. The stability of the monomeric eqFP611variant is further beneficial for imaging of cells fixed withparaformaldehyde: The fluorescence of cells fixed in paraformaldehydeafter expressing the state-of-the-art red fluorescent marker protein“mCherry” steadily faded in course of three weeks. In contrast, thefluorescence of cells expressing the monomeric eqFP611 variant slightlyincreased under the same experimental conditions. This property makesthe monomeric eqFP611 variant an ideal marker for applications that relyon tracking of labeled cells or proteins in tissue fixed withparaformaldehyde.

To examine the performance of the monomeric eqFP611 variant (as setforth in SEQ ID No. 1 or 3) as a marker in live-cell imagingapplications, it was fused to the N-terminus of α-tubulin becauseimaging of microtubules with such a fusion construct is known to be verysensitive to oligomerization, aggregation tendency and C-terminal aminoacid linker properties of the marker protein (8). FIG. 3A exemplifies anexcellent performance of the monomeric variant in highlighting tubulinfibers. Another challenging application, which requires a high stabilityof the marker protein, is its localization in the endoplasmic reticulum(ER). The construct ER-monomeric eqFP611 variant-KDEL yielded excellentimages of the ER with only 0.4 s light exposure in both epifluorescenceand confocal microscopy modes (FIG. 3B, FIGS. 6A-B), whereas the popularmarker protein EGFP targeted to the ER (ER-EGFP-KDEL) required muchlonger exposure times under comparable conditions (FIGS. 6A-B).

Peroxisomes are the organelles that have been discovered last, anddespite their essential functional roles in eukarytotic cells (13), manyaspects of their biology are still not well understood (13). The presentobservation with the early monomeric eqFP611 variant as set forth in SEQID No. 5 that highlighted two peroxisome clusters during mitosis ofHEK293 cells suggests an ordered partitioning of this organell todaughter cells. This finding contrasts the commonly held view thatperoxisomes are distributed stochastically between dividing mammaliancells (14). Hence, it was further explored the issue of peroxisomeinheritance in mammalian cells. To this end, the original C-terminalsequence of eqFP611 in the monomeric eqFP611 variant as set forth in SEQID No. 5 was restored to attain an efficient peroxisomal marker(monomeric eqFP611-PTS). HEK293 cells were co-transfected with monomericeqFP611-PTS and the chromatin-binding histone 2B protein (H2B) fused toEGFP (1) so as to simultaneously image peroxisomes (in red) andchromatin (in green). During interphase, peroxisomes were seen to beevenly distributed over the cytoplasm (FIG. 2C). However, at thebeginning of metaphase, peroxisomes began to congregate in two clusterson the bottom and top of the emerging metaphase plate (FIG. 2D).Additional immunochemical staining of the mitotic spindle revealed thatthe peroxisomes clustered around the spindle poles (FIG. 2E). By thisordered partitioning strategy, the cells can ensure that equal numbersof peroxisomes are inherited by the daughter cells. It was furtherexamined this mechanism with two other mammalian cell lines. HeLa andNIH3T3. In HeLa cells, peroxisomes were also closely associated with thespindle pole during all stages of cell division (FIG. 5). By contrast,NIH3T3 cells showed these peroxisomal associations to the centrosomesonly during two stages of cell division, anaphase and telophase.

We also provided a monomeric variant of the red fluorescent proteinhaving an amino acid sequence comprising a deletion, substitution and/oraddition of one or several amino acids with respect to the amino acidsequence selected from the group consisting of SEQ ID No. 1, 3 and 5 andhaving red fluorescent properties.

The range of “1 to several” in “an amino acid sequence comprising adeletion, substitution, and/or addition of one or several amino acids”is not particularly limited but means, for example, about 1 to 20,preferably 1 to 10, more preferably 1 to 7, furthermore preferably 1 to5, and particularly preferably 1 to 3.

We also provide a fusion compound, preferably a fusion protein,comprising a protein of interest fused to a monomeric variant of the redfluorescent protein eqFP611. The type of protein of interest with whichthe monomeric eqFP611 variant is fused is not particularly limited.Preferred examples may include proteins localizing in cells, proteinsspecific for intracellular structures, in particular intracellularorganelles and targeting signals (e.g., a nuclear transport signal, amitochondrial presequence and the like). The obtained fusion proteinwherein the monomeric eqFP611 variant is fused with a protein ofinterest is allowed to be expressed in cells. By monitoring afluorescence emitted it becomes possible to analyze the activity,localization, processing, or dynamics of the protein of interest incells. That is, cells transformed or transfected with DNA encoding themonomeric eqFP611 variant are observed with the fluorescence microscope,so that the activity, localization, processing and dynamics of theprotein of interest in the cells can visualized and thus analyzed.

The term “protein of interest” is intended also to encompasspolypeptides and peptide fragments. Examples of such proteins ofinterest include: NFκB and subunits thereof, RAC1, PLC domains, MAPKAP2,PKC, Cytochrome C, RHO, β-actin, α-tubulin, STAT6 and protein kinase Cisotypes.

We further provide a composition comprising a monomeric eqFP611 variantand a suitable biologically compatible medium. Examples of suitablebiologically compatible media include: Nutritive media, buffersolutions, pharmaceutically acceptable carriers.

We still further provide DNA encoding a monomeric variant of the redfluorescent protein eqFP611 having an amino acid sequence selected fromthe group consisting of SEQ ID No. 1, SEQ ID No. 3 and SEQ ID No. 5.

The DNA may have a nucleotide sequence selected from the groupconsisting of SEQ ID No. 2, SEQ ID No. 4 and SEQ ID No. 6.

Furthermore, we address a nucleotide sequence comprising a deletion,substitution and/or addition of one or several nucleotides with respectto the nucleotide sequence as set forth in SEQ ID No. 2, 4 or 6. Therange of “1 to several” in “a nucleotide sequence comprising a deletion,substitution and/or addition of one or several nucleotides” is notparticularly limited but means, for example, about 1 to 50, preferably 1to 30, more preferably 1 to 20, furthermore preferably 1 to 10, andparticularly preferably 1 to 5.

The DNA may be prepared synthetically by establishes methods, e.g., thephosphoramidite method (15, 16). According to the phosphoramiditemethod, oligonucleotides are synthesized, e.g., in an automatic DNAsynthesizer, purified, annealed, ligated and cloned into suitablevectors. The DNA may also be prepared by recombinant DNA methodology(17). The DNA may also be prepared by polymerase chain reaction (PCR)(18).

The method of introducing a desired mutation into a predeterminednucleotide sequence is known. For example, DNA having the mutation canbe constructed by appropriately using known techniques such as sitespecific mutagenesis method, PCR using degenerate oligonucleotides, andexposure of cells containing the nucleic acid to a mutagen or radiation(19, 20).

The DNA sequence encoding a monomeric eqFP611 variant may be joinedin-frame with a gene encoding a protein of interest and the desiredfusion protein produced when inserted into an appropriate expressionvector. For example, polymerase chain reaction or complementaryoligonucleotides may be employed to engineer a polynucleotide sequencecorresponding to a monomeric eqFP611 variant, 5′ or 3′ to the genesequence corresponding to the protein of interest. Alternatively, thesame techniques may be used to engineer a polynucleotide sequencecorresponding to the sequence of a monomeric eqFP611 variant 5′ or 3′ tothe multiple cloning site of an expression vector prior to insertion ofa gene sequence encoding the protein of interest. The polynucleotidesequence corresponding to the sequence of a monomeric eqFP611 maycomprise additional nucleotide sequences to include cloning sites,linkers, transcription and translation initiation and/or terminationsignals, labeling and purification tags.

In a further aspect, there is provided an expression vector comprisingsuitable expression control sequences operably linked to a DNA molecule.The DNA may be inserted into a recombinant vector, which may be anyvector that may be conveniently be subjected to recombinant DNAprocedures. The choice of vector will often depend on the host cell intowhich it is to be introduced. Thus, the vector may be an autonomouslyreplicating vector, e.g., a vector which exists as an extrachromosomalentity, the replication of which is independent of chromosomalreplication, e.g., a plasmid. Alternatively, the vector may be onewhich, when introduced into a host cell, is integrated into the hostcell genome and replicated together with the chromosome(s) into which ithas been integrated.

The vector is preferably an expression vector in which the DNA sequenceencoding a monomeric eqFP611 variant is operably linked to additionalsegments required for transcription of the DNA. In general, theexpression vector is derived from plasmid or viral DNA, or may containelements of both. The term, “operably linked” indicates that thesegments are arranged so that they function in concert for theirintended purposes, e.g., transcription initiates in a promoter andproceeds through the DNA sequence coding for a monomeric eqFP611variant. The promoter may be any DNA sequence which showstranscriptional activity in a suitable host cell of choice, (e.g., abacterial cell, a mammalian cell, a yeast cell, or an insect cell) forexpressing a monomeric eqFP611 variant of the invention. The promotermay be derived from genes encoding proteins either homologous orheterologous to the host cell.

Examples of suitable promoters for directing the transcription of theDNA sequence encoding the monomeric eqFP611 variant are the CMVpromoter, Ubiquitin C promoter. SV40 promoter and MT-1 (metallothioneingene) promoter. An example of a suitable promoter for use in insectcells is the polyhedron promoter. Examples of suitable promoters for usein yeast host cells include promoters from yeast glycolytic genes oralcohol dehydrogenase genes or ADH2-4c promoters. Examples of suitablepromoters for use in bacterial host cells include the promoter of theBacillus stearothermophilus maltogenic amylase gene, the Bacilluslicheniformis alpha-amylase gene, the Bacillus amyloliquefaciens BANamylase gene, the Bacillus subtilis alkaline protease gene, or theBacillus pumilus xylosidase gene, or the phage Lambda PR or PL promotersor the Escherichia coli lac, trp or tac promoters.

The DNA sequence encoding a monomeric eqFP611 variant may also, ifnecessary, be operably connected to a suitable terminator, such as thehuman growth hormone terminator or the TPI1 or ADH3 terminators. Thevector may further comprise elements such as polyadenylation signals(e.g., from SV40 or the adenovirus 5 E1b region), transcriptionalenhancer sequences (e.g., the SV40 enhancer) and translational enhancersequences (e.g., the ones encoding adenovirus VA RNAs).

The recombinant vector may further comprise a DNA sequence enabling thevector to replicate in the host cell in question. An example of such asequence (when the host cell is a mammalian cell) is the SV40 origin ofreplication. When the host cell is a yeast cell, examples of suitablesequences enabling the vector to replicate are the yeast plasmid 2μreplication genes REP 1-3 and origin of replication. The vector may alsocomprise selectable markers, such as a gene that confers resistance to adrug, e.g., ampicillin, kanamycin, tetracyclin, chloramphenicol,puromycin, neomycin or hygromycin.

A further aspect relates a host cell having the DNA. Preferably, thehost cell is transformed or transfected with a DNA construct comprisingan expression vector. The DNA construct or the recombinant vector issuitably introduced into a host cell which may be any cell which iscapable of expressing the present DNA construct and includes bacteria,yeast and higher eukaryotic, in particular mammalian, cells.

Examples of bacterial host cells which, on cultivation, are capable ofexpressing the DNA construct are Gram-positive bacteria, e.g., speciesof Bacillus or Gram-negative bacteria such as E. coli. Thetransformation of the bacteria may be effected by using competent cellsin a known manner.

Examples of suitable mammalian cell lines are the HEK293 and the HeLacell lines, primary cells, and the COS (e.g., ATCC CRL 1650), BHK (e.g.,ATCC CRL 1632, ATCC CCL 10), CHL (e.g., ATCC CCL 39) or CHO (e.g., ATCCCCL 61) cell lines. Methods of transforming or transfecting mammaliancells and expressing DNA sequences introduced in the cells are known(21-23).

Examples of suitable yeast cells include cells of Saccharomyces spp. orSchizosaccharomyces spp., in particular strains of Saccharomycescerevisiae or Saccharomyces kluyveri. Transformed cells may be selectedby a phenotype determined by a selectable marker, commonly drugresistance or the ability to grow in the absence of a particularnutrient, e.g., leucine. A preferred vector for use in yeast is the POT1vector. The DNA sequence encoding a monomeric eqFP611 variant may bepreceded by a signal sequence and optionally a leader sequence, e.g., asdescribed above. Further examples of suitable yeast cells are strains ofKluyveromyces, such as K. lactis, Hansenula, e.g., H. polymorpha, orPichia, e.g., P. pastoris (24).

There is further provided a fluorescent reagent kit comprising at leastone of: a) the DNA of SEQ ID No. 2, 4 or 6 b) an expression vectorhaving the DNA of SEQ ID No. 2, 4 or 6 and c) a host cell having the DNAof SEQ ID No. 2, 4 or 6.

Further, there is provided a method for preparing a monomeric variant ofthe red fluorescent protein eqFP611, the method comprising:

-   -   (a) cultivating a host cell so that the host cell produces a        monomeric variant of the red fluorescent protein eqFP611; and    -   (b) isolating and purifying said monomeric variant so produced        by the host cell.

Suitably, the host cells, generally transformed or transfected asdescribed above, are cultured in a suitable nutrient medium underconditions permitting the expression of a DNA encoding a monomericeqFP611 variant. After completion of the culture, the cells may berecovered by centrifugal separation, and the recovered cells may besuspended in a water type buffer. Thereafter, the cells are typicallydisintegrated using an ultrasonic disintegrator or the like, so as toobtain a cell-free extract. A supernatant is obtained by centrifugingthe cell-free extract, and then, a purified sample can be obtained fromthe supernatant by applying, singly or in combination. For instance, thefollowing ordinary protein isolation and purification methods may beapplied: the solvent extraction, the salting-out method using ammoniumsulfate or the like, the desalting method, the precipitation methodusing an organic solvent, the anion exchange chromatography using resinssuch as diethylaminoethyl (DEAE) sepharose, the cation exchangechromatography using resins such as S-Sepharose FF (manufactured byPharmacia), the hydrophobic chromatography using resins such as butylsepharose or phenyl sepharose, the gel filtration method using amolecular sieve, the affinity chromatography, the chromatofocusingmethod and electrophoresis such as isoelectric focusing.

The medium used to culture the cells may be any conventional mediumssuitable for growing the host cells, such as minimal or complex mediacontaining appropriate supplements. Suitable media are available fromcommercial suppliers or may be prepared according to published protocolswhich are known.

We also provide a method for selecting cells expressing a protein ofinterest comprising:

-   -   (a) introducing into cells a DNA encoding a protein of interest        and a DNA encoding a monomeric variant of the red fluorescent        protein eqFP611;    -   (b) cultivating the cells from (a) under conditions permitting        expression of the the monomeric variant and the protein of        interest; and    -   (c) selecting cultivated cells from (b) which express the        monomeric variant.

We also encompass a method for measuring, in particular determiningand/or monitoring, expression of a protein of interest in a cellcomprising:

-   -   (a) introducing into a cell a DNA encoding a monomeric variant        of the red fluorescent protein eqFP611 fused to a DNA encoding a        protein of interest the DNA being operably linked to and under        the control of an expression control sequence which moderates        expression of the protein of interest;    -   (b) cultivating the cell under conditions suitable for the        expression of the protein of interest; and    -   (c) detecting the fluorescence emission of the monomeric variant        as a means for measuring, in particular determining and/or        monitoring, the expression of the protein of interest.

The term “operably linked” means that the regulatory sequences necessaryfor expression of the coding sequence are placed in the DNA molecule inthe appropriate positions relative to the coding sequence so as toeffect expression of the coding sequence. The same definition issometimes applied to the arrangement of coding sequences andtranscription control elements (EG promoters, enhances, and terminationelements) in an expression vector. This definition is also sometimesapplied to the arrangement of nucleic acid sequences of a first and asecond nucleic acid molecule wherein a highbred nucleic acid molecule isgenerated.

With respect to the above-mentioned method, the monomeric eqFP611variants act as a fluorescent labeling substances. This is to say, themonomeric eqFP611 variant is preferably purified as a fusion proteinwith a protein of interest, and the fusion protein is introduced intocells by methods such as the microinjection. By observing thedistribution of the fusion protein over time, targeting activity of theprotein of interest can be detected in the cells.

As already mentioned, the type of the protein of interest with which themonomeric eqFP611 variant is fused is not particularly limited.Preferred examples may include proteins localizing in cells, proteinsspecific for intracellular structures, in particular organelles, andtargeting signals (e.g., a nuclear transport signal a mitochondrialpresequence and the like). For example, by using a protein specific foran intracellular structure in particular intracellular organella, as aprotein of interest, the distribution and movement of nucleus,mitochondria, an endoplasmic reticulum, a Golgi body, a secretoryvesicle, a peroxisome or the like can be observed.

Accordingly, we provide a method for determining the localization, inparticular intra- and/or extracellular localization, of a protein ofinterest comprising:

-   -   (a) introducing into a cell a DNA encoding a monomeric variant        of the red fluorescent protein eqFP611 fused to a DNA encoding a        protein of interest the DNA being operably linked to and under        the control of a suitable expression control sequence;    -   (b) cultivating the cell under conditions suitable for the        expression of the protein of interest; and    -   (c) determining the localization of the protein of interest by        detecting the fluorescence emission of the monomeric variant by        optical means.

We further provide a method for localizing a subcellular structure in acell comprising:

-   -   (a) introducing into a cell a DNA encoding a monomeric variant        of the red fluorescent protein eqFP611 fused to a DNA encoding a        protein or peptide specific for a subcellular structure the DNA        preferably being operably linked to and under the control of an        expression control sequence which moderates expression of the        protein or the peptide specific for a subcellular structure;    -   (b) cultivating the cell under conditions suitable for the        expression of the protein or the peptide specific for a        subcellular structure; and    -   (c) detecting the fluorescence emission of the monomeric variant        as a means for localizing the subcellular structure in the cell.

Suitable proteins being specific for cellular substructures, inparticular organelles are proteins being specific for endoplasmicreticulum, peroxisomes, nucleus, in particular organelles with pH valuesdeviating from 7. Alternatively, the peptide specific for a subcellularstructure may be a targeting signal, in particular cellularesubstructures, for instance peroxisomes, endoplasmic reticulum, nucleus,mitochondria and the like. To this means, a monomeric eqFP611 variantmay be fused to the targeting signal for labeling of cellularsubstructures, in particular for labeling the aforementioned organelles.

Further, we provide a method of comparing the effect of one or more testsubstance(s) on the expression and/or localization of a protein ofinterest comprising:

-   -   (a) introducing into a cell a DNA encoding a monomeric variant        of the red fluorescent protein eqFP611 fused to a DNA encoding a        protein of interest the DNA being operably linked to and under        the control of a suitable expression control sequence;    -   (b) cultivating the cell under conditions suitable for the        expression of the protein of interest in the presence and        absence of the test substance(s);    -   (c) determining the expression and/or localization of the        protein of interest by detecting the fluorescence emission of        the monomeric variant by optical means; and    -   (d) comparing the fluorescence emission obtained in the presence        and absence of the test substance(s) to determine the effect of        the test substance(s) on the expression and/or localization of        the protein of interest.

Finally, a method determines promoter activity in a cell comprising:

-   -   (a) introducing into a cell a vector that is constructed such        that DNA encoding a monomeric variant of the red fluorescent        protein eqFP611 is located downstream of a promoter to be        tested; and    -   (b) detecting the fluorescence of the monomeric variant as a        means for determining the activity of the promoter.

The fluorescence of the monomeric eqFP611 variant can be detected with aviable cell or with fixed cells. Such detection can be carried outusing, for example, a fluorescence microscope (Axiophoto Filter Set 09manufactured by Carl Zeiss) or an image analyzer (Digital Image Analyzermanufactured by ATTO). The type of a microscope can be appropriatelyselected depending on purposes. Where frequent observation such aspursuit of a change over time is carried out, an ordinary incident-lightfluorescence microscope is preferable. Where observation is carried outwhile resolution is emphasized, for example, in the case of searchinglocalization in cells specifically, a confocal laser scanning microscopeis preferable. In terms of maintenance of the physiological state ofcells and prevention from contamination, an inverted microscope ispreferable as microscope system. When an erecting microscope with ahigh-powered lens is used, a water immersion lens can be used. Anappropriate filter set can be selected depending on the fluorescencewavelength of the monomeric eqFP611 variant. For example, since thewavelength of maximum absorption of the monomeric eqFP611 variant havingthe amino acid sequence as set forth in SEQ ID No. 1 is 558 nm, and thewavelength of maximum fluorescence thereof is 605 nm, circa 550 to 610nm filter for fluorescence light can be used. Due to the large Stokesshift of the inventive monomeric eqFP611 variants, there is nocross-talk between the filters of a filter set when detecting thefluorescence of the monomeric eqFP611 variants.

Further, when viable cells are observed with time using a fluorescencemicroscope, photographing should be performed in a short time, andtherefore a high sensitivity cooled CCD camera is used. In the cooledCCD camera, thermal noise is reduced by cooling CCD, and very weakfluorescent images can be clearly photographed with short-time exposure.

Alternatively or in combination to the above described fluorescencemicroscopy, the fluorescence may be detected by tracking, quantifying,and sorting of cells labeled with a monomeric eqFP611 variant using flowcytometry or fluorescence activated cell sorting (FACS).

To sum up, we provide novel monomeric variants of the red fluorescentprotein eqFP611 exhibiting a large Stokes shift. This makes themonomeric variants in particular useful as labeling substances ormarkers (labels), preferably in biological and/or medicinal imaging.Preferably, the monomeric eqFP611 variants may be used as labelingsubstances for tracking cells, for instance in biological tissues or incell cultures. The cells themselves may be living cells. Suitable cellsmay include differentiated cells and stem cells, particularly embryonicstem cells. Due to the superior stability of the monomeric eqFP611variants upon fixation compounds, in particular paraformaldehyde, themonomeric variants of the tetrameric eqFP611 may also be used forlabeling fixed cells. As already mentioned, the monomeric eqFP611variants may further be used as labeling substances for selecting cellsexpressing a protein of interest. Furthermore, the monomeric eqFP611variants may be used as labeling substances for measuring, in particulardetermining and/or monitoring, the expression, preferably the tissuespecific expression, of a protein of interest. A further field ofapplication of the monomeric eqFP611 variants relates to their use aslabeling substances for determining and/or monitoring of celldifferentiation, in particular stem cell differentiation, preferablyembryonic stem cell differentiation. As already mentioned before, themonomeric eqFP611 variants may further be used as labeling substancesfor localization, in particular intra- and/or extracellularlocalization, of a protein of interest or for localization of asubcellular structure in a cell. Besides, the monomeric variants ofeqFP611 may be used as markers for determining the promoter activity ina cell. A further area of application covers the use the inventivemonomeric eqFP611 variants as labeling substances for measuring, inparticular determining and/or monitoring, the gene expressions incellular assays for screening of chemical compounds.

A preferred area of application relates to the use of the monomericeqFP611 variants for diagnosing and/or monitoring of diseases.Preferably, the monomeric eqFP611 variants may be used in whole bodyimaging. For instance, the monomeric variants are used for monitoring oftumor development and/or for tracking of metastases. Furthermore, themonomeric eqFP611 variants may be used for determining and/or monitoringof peroxisomal biogenesis disorders.

The aforementioned fields of application impressively corroborate theversatility of the monomeric eqFP611 variants in the field of biologyand medicine.

EXAMPLES Example 1 Mutagenesis and Screening

The coding cDNA of tetrameric eqFP611 and its monomeric variants wereligated in the pQE-32 (Qiagen, Hilden, Germany). Monomerizing interfacemutations were introduced by site-directed mutagenesis via overlapextension PCR (25). Random mutagenesis was performed according to themanufacturer's protocol using the Diversify PCR Random Mutagenesis Kit(Clontech Laboratories. Inc., Mountain View, Calif., USA), in conditionsoptimal for four to five mutations per 1000 bp. QuikChange MultiSite-Directed Mutagenesis Kit (Stratagene, La Jolla, Calif., USA) wasused subsequently to spot mutations that act synergistically to promotefolding. For this purpose, up to six oligonucleotide primers wereapplied in the mutagenesis reaction to a mixture of up to four differentmutant templates. E. coli XL1-Blue or XL10-Gold® ultracompetent cells(Stratagene, La Jolla, Calif., USA) were transformed with the mutantplasmid libraries and bacterial colonies on agar plates were screenedfor fluorescence after overnight incubation at 37° C. using a UVA (365nm) transilluminator (Benda, Wiesloch, Germany) and a fluorescencemicroscope (Axioplan I, Carl Zeiss Jena GmbH, Jena, Germany).

Example 2 Protein Expression, Purification and Gel FiltrationExperiments

The eqFP61 l-variants carrying an N-terminal poly-His tag were expressedin E. coli M15 [pREP4](Qiagen, Hilden, Germany). The proteins werepurified using Talon™ metal affinity resin (Clontech Laboratories. Inc.,Mountain View, Calif., USA) and gel filtration (Superdex 200, ÄktaSystem, GE Healthcare Biosciences, Little Chalfont, UK) as described (4,26).

Example 3 Determination of Maturation Times

Maturation times at 37° C. were determined as described (4). The proteinsolutions were diluted to ensure an OD_(˜560)˜0.1 after maturation toavoid artificially shortened maturation times due to inner filtereffects. Maturation time courses at 37° C. were recorded with a CaryEclipse Spectrofluorometer (Varian Inc., Palo Alto, Calif., USA).

Example 4 Spectroscopic Characterization

Absorption spectra were recorded on a Cary 50 Spectrophotometer (VarianInc., Palo Alto, Calif., USA). Fluorescence emission and excitationspectra were measures with a Cary Eclipse Spectrofluorometer (VarianInc., Palo Alto, Calif., USA). The spectral characterization of themonomeric eqFP611 variants at pH extremes and photobleaching experimentswere conducted as described in literature (12). Fluorescence lifetimemeasurements on the monomeric eqFP611 variants were performed with aZeiss Axiovert 135 inverted microscope (Carl Zeiss Jena GmbH, Jena,Germany) by using a computer card for time-correlated single-photoncounting (TIMEHARP 200, Pico-Quant, Berlin, Germany) similar to previousreports (4). The excitation light pulses (532 nm) were generated bymode-locked, frequency-doubled solid-state laser (GE-100, Time BandwidthProducts, Zürich, Switzerland). The fluorescence quantum yield in 1×PBS,pH 7.0, was determined using eqFP611 (quantum yield of 0.45) as areference (4). Measurements of the molar extinction coefficient usingthe alkaline denaturation method (Ward, 2005) failed because of theextraordinary stability of the monomeric eqFP611 variants. Highconcentrations of 0.3-1.0 N NaOH had been applied to denature themonomeric eqFP611 variants and to calculate the chromophoreconcentration by using the extinction coefficient of denatured GFP(Green Fluorescent Protein) at pH 13 (27). After complete denaturation,not only was the acylimine bond of the red chromophore reduced, but asignificant fraction of the resulting GFP-chromophore was alsodestroyed, resulting in artificially high molar extinction coefficientsof up to 250,000 M⁻¹ cm⁻¹. Accordingly, the extinction coefficient ofthe monomeric eqFP611 variants were measured with the dynamic differencemethod.

Example 5 Vector Construction

For expression in mammalian cells, various monomeric eqFP611 variantswere cloned into the pcDNA3 vector (Invitrogen, Carlsbad, Calif., USA).Human α-tubulin from the pEGFP-Tub vector (Clontech Laboratories, Inc.,Mountain View, Calif., USA) was inserted into pcDNA3.1(−) downstream ofthe coding sequence of monomeric eqFP611 variants (Invitrogen, Carlsbad,Calif., USA). An endoplasmic reticulum retention signal (KDEL (SEQ IDNo. 7)) (28) was introduced to the C-terminus of the monomeric eqFP611variant and EGFP by PCR. The PCR products were inserted into theexpression vector pCI-leader, encoding the signal peptide(METDT-LLLW-VLLL-WVP-GSTGD (SEQ ID No. 10)) from the murine Igκ chain.

Example 6 Eukaryotic Expression and Imaging

HeLa and HEK293 cells were grown on chambered cover glasses (Nalge NuncInternational Corp., Rochester, N.Y., USA). Images were taken 24-48 hpost-transfection using fluorescene microscopes (DM IRB, LeicaMicrosystems, Wetzlar, Germany, IX71, Olympus, Hamburg, Germany)equipped with a digital camera (C4742, Hamamatsu, Japan), a 100-Wmercury lamp and standard FITC and TRITC filter sets. Subcellularlocalization of monomeric eqFP611 variants with peroxisomal targetingsignals (PTS) were assayed by confocal microscopy. HEK293 cells weretransfected with a pcDNA3 vector encoding monomeric eqFP611 variant witha peroxisomal targeting signal alone or in combination with a pcDNA3vector encoding EGFP-H2B. Cells were fixed, permeabilized, and useddirectly for microscopic analysis or after immunostaining with anantiserum directed against human α-tubulin and an Alexa Fluor®488-coupled secondary antibody. Cells were plated on circular quartzcoverslips (Leica Microsystems, Wetzlar, Germany). Sandwiches of twocoverslips were assembled with PBS/glycerol (13%/87% by volume) as amounting medium. Images were collected on a Leica TCS 4Pi scanningconfocal laser microscope (Leica Microsystems. Wetzlar. Germany) using a100×, NA 1.35, glycerol immersion objective and 488-nm excitation forEGFP and Alexa Fluor® 488, whereas monomeric eqFP611 variant was excitedat 561 nm. The emitted photos were registered by using two avalanchephotodiodes (APDs; SPCM-AQR-14. Perkin-Elmer, Canada). Amaximum-intensity projection of a 3D stack was prepared forpresentation.

TABLE OF REFERENCES

-   1. Shaner, N. C. et al. J. Cell Sci. 120, 4247-4260 (2007).-   2. Wiedenmann, J. Patent DE 197 18 640, 1-18 (1997).-   3. Matz, M. V. et al. Nat. Biotechnol. 17, 969-973 (1999).-   4. Wiedenmann, J. et al. Proc. Natl. Acad. Sci. USA 99, 11646-11651    (2002).-   5. Yarbrough, J. et al. Proc. Natl. Acad. Sci. USA 98, 463-467    (2001).-   6. Merzlyak, E. M. et al. Nat. Methods 4, 555-557 (2007).-   7. Campbell, R. E. et al. Proc. Natl. Acad. Sci. USA 99, 7877-7882    (2002).-   8. Shaner, N. C. et al. Nat. Biotechnol. 22, 1567-1572 (2004).-   9. Karasawa, S., et al. Biochem. J. 381, 307-312 (2004).-   10. Rizzo, M. A., et al. Nat. Biotechnol. 22, 445-449 (2004).-   11. Sheherbo, D. et al. Nat. Methods 4, 741-746 (2007).-   12. Wiedenmann, J. et al. J. Biomed. Opt. 10, 14003 (2005).-   13. Schrader, M. & Fahimi, H. D., Histochem. Cell Biol. (Epub ahead    of print) (2008).-   14. Schrader, M. & Yoon, Y., Bioessays 29, 1105-1114 (2008).-   15. Beaucage and Caruthers, Tetrahedron Letters 22, 1859-1869    (1981).-   16. Matthes et al., EMBO Journal 3, 801-805 (1984).-   17. Sambrook, J. et al., Molecular Cloning—A Laboratory Manual, Cold    Spring Harbor Laboratory Press (1989).-   18. Saiki et al., Science 239, 487-491 (1988).-   19. Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring    Harbor Laboratory, Cold Spring Harbor, N.Y., (1989)-   20. Current Protocols in Molecular Biology, Supplement 1 to 38, John    Wiley & Sons (1987-1997).-   21. Kaufman and Sharp, J. Mol. Biol., 159, 601-621 (1982).-   22. Southern and Berg, J. Mol. Appl. Genet., 1, 327-341 (1982).-   23. Loyter et al., Proc. Natl. Acad. Sci USA, (1982).-   24. Gleeson et al., J. Gen. Microbiol., 132, 3459-3465 (1986).-   25. Ho S N, Hunt H D, Horton R M, Pullen J K, Pease L R (1989)    Site-directed mutagenesis by overlap extension using the polymerase    chain reaction. Gene 77: 51-59.-   26. Wiedenmann J, Vallone B, Renzi F, Nienhaus K, Ivanchenko S, et    al. (2005) Red fluorescent protein eqFP611 and its genetically    engineered dimeric variants. Journal of Biomedical Optics 10: 14003.-   27. Ward W W (2005) Biochemical and Physical Properties of Green    Fluorescent Protein. In: Chalfie M, Kain S R, editors. Green    fluorescent protein: properties, applications and protocols. 2nd ed.    Hoboken, USA: Wiley and Sons. pp. 39-65.-   28. Munro S, Pelham H R (1987) A C-terminal signal prevents    secretion of luminal ER proteins. Cell 48: 899-907.

SUPPLEMENTAL TABLE 1 Peroxisomal localization of monomeric eqFP611variants with different C-terminal sequences in HEK293 cells.Peroxisomal localization^(a) SEQ  24 h post- 48 h post- ID  trans-trans- C-terminal sequence No. fection fection _216HAVAKFCDLPSKLGRL₂₃₁^(b) 11 ++++++ ++++++ _216HAVAKHSGLL ₂₂₅ 12 +++++− +++++− _216HAVAKHSGL₂₂₄ 13 +++−−− +++−−− _216HAVAKFAGL ₂₂₄ 14 −−−−−− ++−−−− _216HAEAQFSGLL₂₂₅ 15 −−−−−− +−−−−− _216HAVAKHSG ₂₂₃ 16 −−−−−− −−−−−− _216HAVAKHSGGG₂₂₅ 17 −−−−−− −−−−−− _216HAEAQFSG ₂₂₃ 18 −−−−−− −−−−−− _216HAVAKFCDL ₂₂₄19 −−−−−− −−−−−− _216HAVAKFAGGG ₂₂₅ 20 −−−−−− −−−−−− _216HAVAKFAGLGGG₂₂₇ ^(c) 21 −−−−−− −−−−−− ^(a)Rating based on the co-localization withEGFP-SKL from complete (++++++) to absent (−−−−−−) ^(b)C-terminalsequence of wildtype eqFP611 ^(c)C-terminal sequence of monomeric eqFPvariants as set forth in SEQ ID No. 1

SUPPLEMENTAL TABLE 2 Fluorescence properties of monomeric eqFP variantsas set forth in SEQ ID No. 1, its predecessors and other monomericfluorescent proteins emitting in the orange to far-red spectral region.Excitation Emission Maximum Maximum Stokes E_(mol) Relative FP Variant[nm] [nm] shift QY [M⁻¹ cm⁻¹] Brightness^(d) mKO (Karasawa et al., 2004)548 559 11 0.6  51,600^(c) 1.0^(c) mOrange (Shaner et al., 2004) 548 56214 0.69  71,000^(a) 1.5^(a) tagRFP (Merzlyak et al., 2007) 555 584 290.48 100,000^(a) 1.5^(a) mRuby 558 605 47 0.35 not applicable^(a)1.2^(b) 112,000^(b) eqFP611 (Wiedenmann et al., 2002) 559 611 52 0.45116,000^(a) 1.6^(a) 146,000^(b) 2.1^(b) RFP611 559 611 52 0.48120,000^(a) 1.8^(a) 151,000^(b) 2.3^(b) mCherry (Shaner et al., 2004)587 615 28 0.22  72,000^(a) 0.5^(a) mRaspberry (Wang et al., 2004) 598625 27 0.15  86,000^(a) 0.4^(a) mKate (Sheherbo et al., 2007) 588 635 470.33  45,000^(a) 0.5^(a) mPlum (Wang et al., 2004) 590 649 59 0.1 41,000^(a) 0.1^(a)   143,400^(b,c) 0.4^(b) ^(a)Concentration of the redchromophore determined by the alkaline denaturation method (Gross etal., 2000). ^(b)Concentration of the red chromophore determined by thedynamic difference method. ^(c)Concentration of the red chromophorededuced from the protein concentration as determined by the Bradfordtest. ^(d)Product of QY and E_(mol) compared to the brightness of EGFP(53,000 M⁻¹ cm⁻¹ × 0.6) (Patterson et al., 1997).

Sequences (in accordance with the annexed sequence protocol)SEQ 1D No. 1 (denoted as mRuby):MNSLIKENMRMKVVLEGSVNGHQFKCTGEGEGNPYMGTQTMRIKVIEGGPLPFAFDILATSFMYGSRTFIKYPKGIPDFFKQSFPEGFTWERVTRYEDGGVITVMQDTSLEDGCLVYHAQVRGVNFPSNGAVMQKKTKGWEPNTEMMYPADGGLRGYTHMALKVDGGGHLSCSFVTTYRSKKTVGNIKMPGIHAVDHRLERLEESDNEMFVVQREHAVAKFAGLGGG. SEQ ID No. 2 (denoted as mRuby):ATGAATTCACTGATCAAGGAAAATATGCGTATGAAGGTGGTCCTGGAAGGTTCGGTCAACGGCCACCAATTCAAATGCACAGGTGAAGGAGAAGGCAATCCGTACATGGGAACTCAAACCATGAGGATCAAAGTCATCGAGGGAGGACCCCTGCCATTTGCCTTTGACATTCTTGCCACGAGCTTCATGTATGGCAGCCGTACTTTTATCAAGTACCCGAAAGGCATTCCTGATTTCTTTAAACAGTCCTTTCCTGAGGGTTTTACTTGGGAAAGAGTTACAAGATACGAAGATGGTGGAGTCATTACCGTTATGCAGGACACCAGCCTTGAGGATGGCTGTCTCGTTTACCACGCCCAAGTCAGGGGGGTAAACTTTCCCTCCAATGGTGCCGTGATGCAGAAGAAGACCAAGGGTTGGGAGCCTAATACAGAGATGATGIATCCAGCAGATGGTGGTCTGAGGGGATACACTCATATGGCACTGAAAGTTGATGGTGGTGGCCATCTGTCTTGCTCTTTCGTAACAACTTACAGGTCAAAAAAGACCGTCGGGAACATCAAGATGCCCGGTATCCATGCCGTTGATCACCGCCTGGAAAGGTTAGAGGAAAGTGACAATGAAATGTTCGTAGTACAACGCGAACACGCAGTTGCCAAGTTCGCCGGGCTTGGTGGTGGTTAASEQ ID No. 3 (denoted as mRuby, codon optimized variant):MNSLIKENMRMKVVLEGSVNGHQFKCTGEGEGNPYMGTQTMRIKVIEGGPLPFAFDILATSFMYGSRTFIKYPKGIPDFFKQSFPEGFTWERVTRYEDGGVITVMQDTSLEDGCLVYHAQVRGVNFPSNGAVMQKKTKGWEPNTEMMYPADGGLRGYTHMALKVDGGGHLSCSFVTTYRSKKTVGNIKMPGIHAVDHRLERLEESDNEMFVVQREHAVAKFAGLGGG*SEQ ID No. 4 (denoted as mRuby, codon optimized variant):ATGAACAGCCTGATCAAAGAAAACATGCGGATGAAGGTGGTGCTGGAAGGCAGCGTGAACGGCCACCAGTTCAAGTGCACCGGCGAGGGCGAGGGCAACCCCTACATGGGCACCCAGACCATGCGGATCAAAGTGATCGAGGGCGGACCTCTGCCCTTCGCCTTCGACATCCTGGCCACATCCTTCATGTACGGCAGCCGGACCTTCATCAAGTACCCCAAGGGCATCCCCGATTTCTTCAAGCAGAGCTTCCCCGAGGGCTTCACCTGGGAGAGAGTGACCAGATACGAGGACGGCGGCGTGATCACCGTGATGCAGGACACCAGCCTGGAAGATGGCTGCCTGGTGTACCATGCCCAGGTCAGGGGCGTGAATTTTCCCAGCAACGGCGCCGTGATGCAGAAGAAAACCAAGGGCTGGGAGCCCAACACCGAGATGATGTACCCCGCTGACGGCGGACTGAGAGGCTACACCCACATGGCCCTGAAGGTGGACGGCGGAGGGCACCTGAGCTGCAGCTTCGTGACCACCTACCGGTCCAAGAAAACCGTGGGCAACATCAAGATGCCCGGCATCCACGCCGTGGACCACCGGCTGGAAAGGCTGGAAGAGTCCGACAACGAGATGTTCGTGGTGCAGCGGGAGCACGCCGTGGCCAAGTTCGCCGGCCTGGGCGGAGGGTGASEQ ID No. 5 (denoted as mRuby, with peroxisomal targeting signal):MNSLIKENMRMKVVLEGSVNGHQFKCTGEGEGNPYMGTQTMRIKVIEGGPLPFAFDILATSFMYGSRTFIKYPKGIPDFFKQSFPEGFTWERVTRYEDGGVITVMQDTSLEDGCLVYHAQVRGVNFPSNGAVMQKKTKGWEPNTEMMYPADGGLRGYTHMALKVDGGGHLSCSFVTTYRSKKTVGNIKMPGIHAVDHRLERLEESDNEMFVVQREHAVAKFCDLPSKLGRLSEQ ID No. 6 (denoted as mRuby, with peroxisomal targeting signal):ATGAATTCACTGATCAAGGAAAATATGCGTATGAAGGTGGTCCTGGAAGGTTCGGTCAACGGCCACCAATTCAAATGCACAGGTGAAGGAGAAGGCAATCCGTACATGGGAACTCAAACCATGAGGATCAAAGTCATCGAGGGAGGACCCCTGCCATTTGCCTTTGACATTCTTGCCACGAGCTTCATGTATGGCAGCCGTACTTTTATCAAGTACCCGAAAGGCATTCCTGATTTCTTTAAACAGTCCTTTCCTGAGGGTTTTACTTGGGAAAGAGTTACAAGATACGAAGATGGTGGAGTCATTACCGTTATGCAGGACACCAGCCTTGAGGATGGCTGTCTCGTTTACCACGCCCAAGTCAGGGGGGTAAACTTTCCCTCCAATGGTGCCGTGATGCAGAAGAAGACCAAGGGTTGGGAGCCTAATACAGAGATGATGTATCCAGCAGATGGTGGTCTGAGGGGATACACTCATATGGCACTGAAAGTTGATGGTGGTGGCCATCTGTCTTGCTCTTTCGTAACAACTTACAGGTCAAAAAAGACCGTCGGGAACATCAAGATGCCCGGTATCCATGCCGTTGATCACCGCCTGGAAAGGTTAGAGGAAAGTGACAATGAAATGTTCGTAGTACAACGCGAACACGCAGTTGCCAAGTTCTGTGACCTTCCATCCAAACTGGGACGTCTTTGA

The invention claimed is:
 1. A DNA encoding a monomeric variant of redfluorescent protein eqFP611, wherein the monomeric variant comprises anamino acid sequence selected from the group consisting of SEQ ID NO: 1and SEQ ID NO:
 5. 2. A DNA comprising a nucleotide sequence selectedfrom the group consisting of SEQ ID NO: 2, SEQ ID NO: 4, and SEQ ID NO:6.
 3. An expression vector comprising expression control sequences andthe DNA according to claim 1, wherein the expression control sequencesare operably linked to the DNA.
 4. An expression vector comprisingexpression control sequences and the DNA according to claim 2, whereinthe expression control sequences are operably linked to the DNA.
 5. Anisolated host cell transformed or transfected with the DNA according toclaim
 1. 6. An isolated host cell transformed or transfected with theDNA according to claim
 2. 7. An isolated host cell transformed ortransfected with the expression vector according to claim
 3. 8. Theisolated host cell according to claim 5, selected from the groupconsisting of a mammalian cell, a bacterial cell, a yeast cell, and aninsect cell.
 9. The isolated host cell according to claim 7, selectedfrom the group consisting of a mammalian cell, a bacterial cell, a yeastcell, and an insect cell.
 10. A fluorescent reagent kit comprising atleast one of: a) a DNA comprising the nucleotide sequence of SEQ ID NO:2, 4, or 6, b) an expression vector comprising a DNA comprising thenucleotide sequence of SEQ ID NO: 2, 4, or 6, and c) an isolated hostcell transformed or transfected with a DNA comprising the nucleotidesequence of SEQ ID NO: 2, 4, or
 6. 11. A method of preparing a monomericvariant of red fluorescent protein eqFP611 comprising: (a) cultivatingthe isolated host cell according to claim 5 so that the host cellproduces a monomeric variant of the red fluorescent protein eqFP611,wherein the monomeric variant comprises the amino acid sequence of SEQID NO: 1 or SEQ ID NO: 5; and (b) isolating and purifying said monomericvariant so produced by the host cell.
 12. A method of preparing amonomeric variant of red fluorescent protein eqFP611 comprising: (a)cultivating the isolated host cell according to claim 7 so that the hostcell produces a monomeric variant of the red fluorescent proteineqFP611, wherein the monomeric variant comprises the amino acid sequenceof SEQ ID NO: 1 or SEQ ID NO: 5; and (b) isolating and purifying saidmonomeric variant so produced by the host cell.
 13. A method ofpreparing a monomeric variant of red fluorescent protein eqFP611comprising: (a) cultivating the isolated host cell according to claim 8so that the host cell produces a monomeric variant of the redfluorescent protein eqFP611, wherein the monomeric variant comprises theamino acid sequence of SEQ ID NO: 1 or SEQ ID NO: 5; and (b) isolatingand purifying said monomeric variant so produced by the host cell.